LIMIT LAW OF THE LENGTH OF THE STANDARD RIGHT 
FACTOR OF A LYNDON WORD 
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Abstract. Consider the set of finite words on a totally ordered alpfiabet with 
q letters. We prove that the distribution of the length of the standard right 
factor of a random Lyndon word with length n, divided by n, converges to; 

fj,{dx) = —5i(dx) + Irg i\{x)dx, 

q q ' ' ' 

when n goes to infinity. The convergence of all moments follows. This paper 

completes thus the results of giving the asymptotics of the mean length of 

the standard right factor of a random Lyndon word with length n in the case 

of a two letters alphabet. 



1. Introduction 

Consider a finite totally ordered alphabet A and for each n e N = {1, 2, 3, . . . }, 
denote by the set of words with length n on ^. A Lyndon word with length n 
is a word in which is strictly smaller, for the lexicographic order, than each of 
its proper suffixes. We denote by £„ the set of Lyndon words with length n. 

The standard right factor u of a Lyndon word w is its smallest proper suffix 
for the lexicographic order. Any Lyndon word w can be written uv, in which u is 
a Lyndon word and v is the standard right factor of w. We call uv the standard 
factorization of the Lyndon word w. Lyndon words were introduced by Lyndon 
to build a base of the free Lie algebra over A. The standard factorization plays a 
central part in the building algorithm of this base. For each Lyndon word w, we 
can build a binary tree in the following manner: say that w is the root, and has 
two children, that are the factors u and v of the standard factorization of w. Since 
u and V are still Lyndon words, they can also be divided into two standard factors 
which arc their children and so on (see figure^. Then the average height of these 
trees characterizes the complexity of the building algorithm (see Chen, Fox and 
Lyndon |B] and Lothaire ^Ol)- Thus the informations on the length of the standard 
right factor of a random Lyndon word are essential for the analysis of the building 
algorithm. 

For any Lyndon word w £ £„, let i?„(w) denote the length of its standard right 
factor. Endowing £„ with the uniform probability measure makes i?„ a random 
variable on £„. Bassino, Clement and Nicaud |2], with the help of generating 
functions, prove that the expectation E(i?„), in the case of a two letters alphabet, 
is asymptotically equal to 3n/4. The aim of this paper is to determine the limit 
distribution of as n goes to infinity: 
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Figure 1. Binary tree associated to the standard decomposition 
of the Lyndon word aaaaabbabbbaaab 

Theorem 1.1. For a totally ordered alphabet with q letters, the normalized length 
Rn/n of a random Lyndon word of length n converges in distribution, when n goes 
to infinity, to 



where di denotes the Dirac mass on the point 1, and dx Lebesgue's measure on R. 
All the moments of Rn/n also converge to the corresponding moments of the limit 
distribution. 

Remark. In the case q ~ 2, this result was conjectured by Bassino, Clement and 
Nicaud [2- Simulations were also provided in this paper. 

For the proof of this result, we focus first on the case of a two letters alphabet, 
and then we indicate the way to adapt the proof for the case of q letters. 

Random Lyndon words are, in some sense, conditioned random words. In sec- 
tion[51 we obtain the number of Lyndon words with length n by dividing the number 
of primitive words with length n by n, (the shepherd's principle: counting the legs 
and dividing by four to obtain the number of sheeps). Thus the typical statistical 
behavior of a random word and of a random Lyndon word can be easily linked (see 
lemma I^J. 

Our analysis starts in section |3J we recall, among a number of well known prop- 
erties of random words with length n, those useful for our purposes. In particular, 
we study the number of runs and the length of the longest run of "a", which is 
typically of order log2 n. 

The key step is to prove that the two longest runs of "a" are approximately 
located along the word as two independent uniform random variables, and thus 
the distance between the first longest one and the second longest one follows 



fi{dx) = -5i{dx) + 



l[o,i)(a;)da;, 
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approximately the uniform law on [0..n]. The distance Z?„ is of course closely 
related to the length of the standard right factor. We can distinguish two cases: 

• If the word obtained from the Lyndon word by deleting the first "a" is 
still a Lyndon word, then the length of the standard right factor is equal to 
n — 1 ~ n, and this happens with a probability close to 1/2 (this probability 
is replaced by 1/g in the case of an alphabet with q letters); 

• Otherwise, the standard right factor begins by the second longest run of 
"a" . In this case, the length of the standard right factor is equal to n minus 
the distance Z?„, and is then approximately uniformly distributed on [0..n]. 

To prove that D„ is approximately uniformly distributed on [0..n], we cut a 
random word with length n into distinct "long blocks" with length of order log2 n 
(section^}, in such a way that the long runs of "a" are at the beginnings of the long 
blocks. Then we prove that the uniform distribution on Lyndon words is invariant 
under uniform permutation of these blocks (sectional)- Thus the positions of the 
two smallest (for the lexicographic order) long blocks are approximately uniformly 
distributed among all the possible positions of the long blocks. As n goes to infinity, 
the number of long blocks tends to infinity and their lengths are negligible when 
compared to n. This leads to our main result, Theorem 15.41 which says that the 
distance between the two smallest (for the lexicographic order) long blocks, divided 
by the length n of the word, follows asymptotically the uniform law on [0, 1]. In 
sectional we rephrase this result in terms of standard right factor and finally we 
generalize, in section the obtained results to the case of an alphabet with q 
letters. 

2. Random words and random Lyndon words 

Let A = {a, b} be an ordered alphabet (a < b) and A'^ be the set of all words 
with length n. If w G A", write w = {wi, . . . , w„) and define: 

TW = {W2, . . . ,Wn,Wi). 

Then < t {Id, r, . . . , r"^^} is the group of cyclic permutations of the letters of 
a word with length n. A word w G A" is called primitive if 

{3k G {0, 1, . . . , 71 - 1} such that t'^w = w) =^ (fc = 0). 

Denote by Vn the set of primitive words in A" , by J\fn its complement. 

Remember that a Lyndon word with length n is a word in A" which is strictly 
smaller, for the lexicographic order, than each of its proper suffix: it is equivalent 
to say that a word w with length n is a Lyndon word if and only if it is strictly 
smaller for the lexicographic order than every t^w with k G {l,...,n — 1}. We 
denote by the set of Lyndon words with length n. 

The group < r > of cyclic permutations acts on A^ , and Vn and are stable 
under this action. Each orbit associated to a primitive word w contains exactly n 
distinct words, and a unique Lyndon word, denoted by p(w), which is the smallest 
word in the orbit for the lexicographic order: the application p is then the canonical 
projection of P„ on £„ associated to the action of < t >. 

Example 2.1. 

• If w ~ aabaaa, then p{w) = aaaaab. 

• If A — {aab,abb} then p^^{A) — {aab,aba,baa}U {abb,bab,bba} . 
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As the set Mn of non-primitive words contains no Lyndon word, we have, by the 
shepherd's principle, that: 

card{Vn) ~ n x card(Cn)- 

Via the relation card(Vn) ~ ^"^''/^('^)' 'where /x is the Mobius function (see 

the book by Lothaire ^HDj we arc lead to: 

on 

card{Cn) = — (1 + 0(2-"/2)) card{Mn) = 0(2"/2). 
n 

In the sequel, we will consider the two following probability spaces: 

• the set of words with length n, endowed with the uniform probability P„, 

• the set Cn of Lyndon words with length n, endowed with the uniform 
probability P„. 

The probability measure P„ can be seen as the conditional probability on 
given £„. The next lemma is obvious, but it is very useful in our proofs because it 
allows to transfer results on random words to random Lyndon words by neglecting 
non-primitive words and using the shepherd's principle. 

Lemma 2.2. For A C £„, we have: 

ip;,(A)-p„(p-i(A)) i<o(2-"/2). 

Proof. It is sufficient to note that P„(A) = ¥„{p-^{A)\Vn) and P„(P„) = 1 - 
O (2-"/2) . 



3. Number of runs and length of the longest run 

This section deals with the number of runs and the length of the longest run 
in a random Lyndon word. The results exposed in this section are not new, but 
are presented in a convenient way for our proofs. The method is to get results for 
random words, and to transfer them to random Lyndon words via lemma 

Definition 3.1. Let w be a word in We denote by Nniw) the number of runs 
in w, by Xi(w), X2{w), . . . , Xpf^{w) their lengths, by Mn{w) ~ Taax{Xi{w), 1 < 
i ^ Nniw)} the length of the longest run in w and by M^{w) the length of the 
longest run of "a" in w. 



Example 3.2. n = 9 
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w = aabbbbaaa 


3 


2,4,3 
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p{w) = aaaaabbbb 


2 
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Lemma 3.3 (Number of runs). For every 7 > 0, there exists a constant C-y > 
such that for any s > 0, 



2 



O {cxp {-C-yU^'')) 



Proof. First, we prove the above inequality for the probability measure P„ on the 
set A^ of words with length n. The cardinal of the event {N„ = k} corresponds 
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to the number of eompositions of the integer n with k parts (see Andrews and 
Pitman [E]): 

1 /r7 — 1 

VfcG {l,2,...,n}, P„(7V„ = fc) 



2"-i \k - 1 

Thus Nn — 1 is a binomial random variable with parameters (n — 1 , 1/2) , whose large 
deviations are well known (see the book by BoUobas [SJ Th. 7, p. 13] for instance): 
there exists a positive constant C!y such that 

P„(|a^„-^| >7"-^/'+^) < 0(cxp(-CX^')). 

To obtain the same inequality for the probability measure P„ on the set £„ of 
Lyndon words with length n, note that for a primitive word w, we have Nn^w) — 1 < 
Nn{p{w)) < Nn{w). Thus, using Lemma [2. 21 we obtain the announced result. ■ 

The next step is to study the length of the longest run of a word if S A". For 
this mean, we will use the following construction of the uniform probability measure 
on the set of all infinite words on A: 

Construction 3.4. Let (Zi)igN be independent identically distributed geometrical 
random variables with parameter 1 /2 defined on a probability space (f2, P) and let 
e be a Bernoulli random variable with parameter 1 /2 defined on and independent 
of the Zi 's. To obtain a infinite random sequence of a and b, do the following: 

• if e = 1, write Z\ "a", followed by Z2 "b", followed by Z3 "a" and so on... 

• if e = 0, write Z\ "b ", followed by Z2 "a ", followed by Z3 "b " and so on. . . 

Truncating to keep the n first letters gives a random variable defined on and 
uniformly distributed on Thus, in this setting, the number of runs is: 

Nniiu) = inf jfc e N, ^ Z,(cj) > n| , 

and the lengths of runs are: 

JV„-1 

(1) V2 e {1, . . . , Nn - 1}, = Z, and XN„=n- ^ ^^n- 

i=l 

We denote by log the natural logarithm, by log2(a;) = logx/log2, by [a;] the 
smallest integer larger than x and by [x\ the largest integer smaller than x. The two 
next lemmas give estimates for the length of the longest run in a random Lyndon 
word. These estimates are related with the extreme values theory (see the books 
by Bingham, Goldie and Teugels @], Resnick ^31 or the work of Hitzcnko and 
Louchard or the initial works of Erdos and Renyi [7j and Erdos and Rcvcsz |5|). 

Lemma 3.5 (Longest run, small values). For any e > 0, 

P„ {Mn < (1 - e) log2 n)<0 (^exp (^-^ 

Proof. First, wc prove the above inequality for the probability measure P„ on the 
set A" of words with length n. The idea is that the number of the runs of "a" in a 
random word with length n is highly concentrated around n/4 and that it is easy 
to estimate the maximum of n/4 independent geometrical random variables. 
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Note that Af„ > Af°. Using |^ and lemma rOI with 7 = 1, we have: 

^n{M.n < y) 

< F„ (iV„ ^ 5 (1 - ""^+')) + Pn (^^ < 2/, 1 < i < J (1 - n~i+' 

< F„ (7V„ < f (1 - n-'^'+')) + P„ (Z, < 1 < z < J (1 - 



< 



O (cxp (-Cm^^')) + (1 - 2L-2'J)Lf (i-»"''+lJ. 



To hghtcn notations, we consider the j (^1 — n first Z^'s rather than the Z^'s 

corresponding to runs of "a" , which would have obliged us to distinguish whether 
the words begins with "a" or "6". Taking y = (1 — e) logj n, we obtain easily: 

P„ (M^-J < (1 - e) log2 n)<0 (^exp (^-| 

To obtain the same inequality for the probability measure P„ on the set £„ of 
Lyndon words with length n, note that, for a primitive word w, we have M^^w) < 
M^{p{w)). We can now use lemma to conclude. ■ 

Lemma 3.6 (Longest run, large values). For any 1 < A < 2, 

P„ {M^, > Alog2 n) < P„ (M„ > Alog2 n) < O (n^-^) . 

Proof. As before, we begin by proving the above inequality for the probability 
measure P„ on the set .4". Note that we still have M„ > M°. Thus for y > 0, we 
have: 

F„(Af,'J <y) > P„(A/„ < y) 

= F„(Vz e {!,..., 7V„}, X, <2/) 

(2) > P(V^e {l,...,Ar„}, <y) 

(3) > F(Vje {l,...,n}, Z, <2/) 

> (l-2"LaJ)". 

Inequality (0) hold because of 1^ and inequality ^ because Nn < n. Taking 
y = Alog2 n, we obtain easily the announced upper bound. 

To come back to Lyndon words, note that for a primitive word it;, we have 
M„(p(w)) < max{M„(u'), Thus we obtain: 

P„(M„(p(i(;)) > Alog2n) 

< P„ {M,,{w) > A log2 n) + P„ {Xi + Xn„ > A log^ n) 

< P„ (A log2 n > M^{w)) + P„ [Xi > 4 log2 n) + P„ (xn„ > 4 logs " 



Note that Xi and X^^ have the same law. Thus using we get: 

P„ (^X, > I log2 < F„ (^Zi > I log2 = 0(n-4) ^ « , 

by the choice we made for A. Finally, by using Lemma 12.21 we get the desired 
result. ■ 
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4. Building of long blocks and short blocks of a word 

Let < £ < 1 and _B > 2 be fixed in this section. Our aim here is to find, 
in a word w € A", some long blocks beginning by a long run of "a" in a word 
w G and we moreover want to choose them long enough to be distinct with 
high probability. We study then the positions of these long blocks along the word 
w. Here is our definition: 

Definition 4.1 (Long blocks). Let w be a word with length n. The long blocks of 
w are the subwords of w that: 

• begin with a run of "a" with length equal or greater than (1 — e) logj n, 

• end with a run of "b" 

• have the smallest possible length larger than 31og2n. 
We denote by Hn the number of long blocks. 

The next lemma estimates the number of long blocks for a random Lyndon word. 
Note that although the crude estimate we give could be sharpen, it is sufficient for 
our mean. 

Lemma 4.2 (Number of long blocks). There exists a constant D > such that 
1 9 \ 

-n" <Hn<-n^j > l-O (exp (-Dn^)) . 

Proof. We begin once again by proving the inequality for the probability measure 
P„ on the set A"'. Set, for i > I, Bi = l{Zi>(i-e)iog, «}• Then (-Bi)i>i are indepen- 
dent identically distributed Bernoulli random variables with parameter Pn,e: which 
satisfies < pn,e < 2n^~^. Note that 

n 

l<2i-l<Af„-l i=l 

n 

J2 B2^{w) <Hn{w) <Y,B^{W) if = 6. 

l<2i<Af„-l i=l 

Therefore, by large deviation results for sums of independent Bernoulli random 
variables (see for instance the book by BoUobas Th. 7, p. 13]), there exists 
Di > such that: 



< 



B, - nEBi > -nA < O (exp {-Din")) . 
\i=i J 



In the same manner, by looking only to the iJ^'s with odd indices (when the word 
begins with "a") or only to the i?i's with even indices (when the word begins with 
"&") and using lemma in which 7 = 1, we obtain the existence of D2 > such 
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that: 




< 



< 0{cxp{-Cin^')) +0{cxp{-D2n'')). 

This proves the lemma for random words. 

For random Lyndon words, note that if w is a primitive word, then Hn{w) < 
Hn{p{w)) < Hn{w) + 1 and use lemma UT^ ■ 

The length of the long blocks has been chosen large enough to ensure that two 
long blocks are distinct with high probability: 

Lemma 4.3 (Inequality of long blocks). Denote by the event that a word with 
length n has at least two equal disjoint subwords with length at least 31og2 n. Then: 

Proof. We begin as usual with random words. By counting the number of possible 
subwords with length 3 log2 n and their possible positions, we have: 

Lemma [2.21 gives the same estimate for Lyndon words. ■ 

We also want that the long blocks do not overlap with high probability, or, in 
other words, that the beginnings of long blocks are far away enough with high 
probability. This is ensured by the next lemma: 

Lemma 4.4 (Minimal distance between beginnings of long blocks). Let Dn be the 

event that there exist at least two long blocks which begin at a distance less than 
81og2n. Then: 

P„(i?„) < 0(n-(^-2e) iQg^ 

Proof. As usual; we start with the case of random words: 

¥„{Dn) < P„ (^Hn > ^n'^ + P„ (^Dn n |i/„ < ^n' 

Let us denote by F„ the last event. On Fn, at least one of the Hn subwords with 
length 81og2n starting just after a run of "a" with length at least (1 — e) log2 n 
must admit a subword of "a" with length (1 — e) log2 n (which is the beginning of 
the next long block). By an estimate analogous to the one used in the previous 
lemma, 

P« (Fn) < o|\"81og2n2-(i~^)'°S2"| =0(n-(i-2^) log2 n). 

By lemma the first term is negligible, and Lemma [2.21 concludes for Lyndon 
words. 

Now we consider the set of "good" Lyndon words that satisfy all the previous 
properties: 



STANDARD RIGHT FACTOR OF A LYNDON WORD 



9 



Definition 4.5 (Good Lyndon words). Denote by Qn the set of Lyndon words w 
satisfying the following conditions: 

• the maximal run of "a " satisfies (1 — e) log2 n < M° < 2 logj n 

• the maximal run satisfies (1 — e) logj n < < 2 logg n 

1 9 ^ 

• the number of long blocks satisfies — n"^ < iJ„ < ^"-^Z 

• the beginnings of long blocks are at a distance at least 8 logj n, in the sense 
Qn C Dl; 

• the word w has no equal long blocks, in the sense Qn C -Bfj. 

Note that on Qn, the length of a long block is less than 3 logj n + 2 x 2 log2 n = 
7 log2 n, and that there is no overlapping between two long blocks . The next lemma 
ensures that a large proportion of Lyndon words are good Lyndon words: 

Lemma 4.6. For every n large enough: 

^n[Qn) > 1 - O (n-(l-2e) 

Proof. Everything has been proved in the previous lemmas 13.51 13.61 (in which 
A = 2), 11211121 and 1131 ■ 

Now, wc note that a good Lyndon word w & Qn begins with a long block, ends 
with a run of "5" , and all portions between long blocks begin with a run of "a" and 
end with a run of "6". Wc can thus give the following definition of short blocks: 

Definition 4.7 (Short blocks). For a good Lyndon word w £ Qn, we cut each 
section stretching between two long blocks into short blocks, made of two consecutive 
runs of "a" and "b"(in this order). 

Note that short blocks have length equal or smaller to 41og2 n. 

5. Permutations of blocks for good Lyndon words 

In the previous section, wc have cut any good Lyndon word w into blocks be- 
ginning with a run of "a" and ending with a run of "6" : the long ones and and the 
short ones. The long ones correspond to long runs of "a" , and the first long block 
(at the beginning of w) is, by definition of a Lyndon word, the smallest block for 
the lexicographic order. We are going to see that we can keep this first long block 
of w at the beginning of the word and permute the other blocks, without changing 
the distribution on the set of good Lyndon words. 

In the following, "short" and "long" refer to the type of blocks, while "small" 
and "large" refer to the lexicographic order on words. 

Definition 5.1 (Permutation of blocks for good Lyndon words). Consider w £ Qn- 

1. We denote by Kn{w) the total number of blocks, long and short, of w. 

2. We denote by (yi(w))o<i<if„(u))-i the blocks of w in their order of appearance 
along w. Certainly, the first block Yq(w) is the smallest block among all blocks of w. 

3. Let jo{w) be the index of the second smallest block of w. 

4- We denote by 6/^^(„,)_i the set of permutations of {I, ... , Kn{w) — 1}, and 
define 

a.w = Yo{w)Y^(^i^{w) . . . Y„t^K„{w)-i){'w), 
for a £ 6k„(w)-i- Obviously, a.w G Qn- 

5. We define also C{w) = {a.w, a € &k,^{w)-i}, the set of all words which are 
obtained by the all the permutations of the blocks of w. 
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The two cases of right factor exposed in the introduction can be rephrased in 
the following manner: either the standard right factor is obtained by deleting the 
first "a", or it begins by the second smallest block of w, Yj^i^^jy 

In this section, we study the asymptotics of the position between the two smallest 
blocks, and we will rephrase this result in terms of standard right factor in the next 
section. 

Our main tool is the immediate following property: 

Lemma 5.2 (Invariancc in law under the permutations of blocks). Let wq be a 
fixed good Lyndon word. Consider the set S/f^(„,-,)_i x C{wo), endowed with the 
uniform probability. Then the random variable 

Wu,a ■■ &K„{wo)-l X C{wo) > C{wo) 

{a, w) ' — > a.w 

follows the uniform law on C{wo). 

Proof. It is sufficient to note that, by construction, each word in C{wo) has the 
same family of blocks. ■ 

Thus, roughly speaking, the second smallest block Yj^ has the same probability 
to be at every possible place among all the blocks, and this is why its position 
along the word w, divided by n, should follows approximately the uniform law on 
[0, 1]. To formalize this intuition and to exploit this invariancc property, we enlarge 
our probability space Gn- consider a sequence (J7i)igN of independent identically 
distributed random variables on a probability space {X, X,Q), following the uniform 
distribution on [0, 1]. We denote by IP„ the uniform probability on Qn and consider 
the product probability Q P„ on the product space X x Qn, this means that 
(C/i)igN are independent of the choice of the random Lyndon word in C/„. 

Definition 5.3 (Random permutation). For x G X , we define a uniform random 
permutationiT X G 6a'„(«;)-i the order statistics ofUi{x),U2{x),...,UK„[w)-i{x)- 

Therefore from the previous lemma, the random variable 

[ [X,W) I > TTx-W 

follows the uniform law on We can now study, under the uniform probability 
P„ on Qn, the position of the second smallest block defined by 

dniw) = - ^ \Y{w)\. 
n ^ — ' 

i=o 

Here \w\ denotes the length of the word w. Thus, the random variable 

dr,oW ■ Xxg,, [0,1] 

(X, W) I > dn{TTx-w) 

has the same law as the random variable dn under P„. 

We will thus focus on this new random variable to use the property of invariancc 
under the permutation of blocks. Remember that the convergence in L^ implies 
the convergence in probability; thus, in the following, the notation will denotes 
the L^-norm associated to a probability measure P. 
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Theorem 5.4 (Position of the second smallest block). We have: 



\dnoW - U. 



jo 



< o 



log2 n 



l/2\ 



This implies in particular that the law of (i„ under P„ converges weakly to the 
uniform law on [0, 1] and that every moment of c?„ converges to the corresponding 
moment of the uniform distribution. 

Remark. Coming back to random words, this result implies that the normalized 
distance between the two smallest blocks (which roughly corresponds to the two 
largest runs of "a") asymptotically follows the uniform law on [0, 1]. 

Proof. We have: 



dnOW{x,w) = dn{7Tx-W) 



-\Yo{w)\ + - J2 I^jHI1{c/.(^)<c/,o(™)(-)} 



By conditioning on w and Uj^^y^), and using the fact that \Yi\ = n, we obtain: 



( 



\ 



-\Y,{w)\ + - 
n n 



V 



1 < J < K„(w) - 1 
3 ¥= io{w) 



{Uja(w) + (1 - t^jo(«.))- 



Yo{w)\ 



u. 



Jo(w) ' 



I 



On Qn, 1^0 1 and \Yj^\ are bounded by Tlogj n, so 



E, 



1 - f7. 



u. 



io (•!")" 



\Yoo{w){w)\ 



< 



'l41og2n" 
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which tends to when n goes to infinity. Now, 

\\dn{TTx-w) -Eq^jp^ {dn{Trx-W)\w,Ujg(^^))\ 



E, 



E 



E 



\Y,{w)\ 



U. 



1 < j < K„(w) - 1 
3 ¥= jo{w) 



E 



E 



l>S-HI 



1 < j < K„{w) - 1 
3 ^ 3a{w) 



(4)< E, 

7 log2 n 



E 



j ^ jo{w) 



\Yj{w)\ 



(5)< 



in 



which tends to when n goes to infinity. To obtain inequahty |0J, we conditioned 
first on w and Ujg(^^y, for inequahty we used the facts that, on all blocks 
have length smaller than Tlogj n and that J2i 1^*1 = ^■ 
Consequently, 



30 llQg 



< \\dninx.w) -Eq^p^ {dn{nx.w)\w,Ujg{w)) 
|Eo«p {dn{'!rx-w)\w,Uj„{w)) - C/,o(.ii,)|| 



< o 



"l0g2"- 



1/2N 



The convergence of the other moments is a consequence of the convergence in law, 
as dn is bounded by 1. ■ 



6. Limit distribution of the standard right factor 

In this section, we establish the convergence of the distribution of the normalized 
length of the standard right factor of a random Lyndon word and give the limit 
distribution, which follows quite easily from the result of the previous section. 
Remember that P„ is the uniform probability on the set £„ of Lyndon words with 
length n and that P„ is the uniform probability on the set Qn of good Lyndon 
words with length n. The length of the standard right factor of w S £„ is denoted 
by Rn{w), and we introduce the normalized length of the standard right factor 
r„(w) = Rn{w)/n. 

Theorem 6.1. As n goes to infinity, r„ converges in distribution to 
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where Si denotes the Dirac mass at point 1, and dx Lehesgue's measure on R. 
All the moments of rn also converge to the corresponding moments of the limit 
distribution. 



Proof. First, we split the set £„ in two parts, corresponding to the two cases of 
the introduction: 

• = aCn-i contains exactly the Lyndon words w whose standard right 
factor is obtained by deleting the first "a" of the word and has thus nor- 
malized length rn{w) ~ {n ~ ^)/n. Note that 

card(Cn-i) 1 
card{Cn) 2 

• C\ ~ Cn\C\ contains exactly the Lyndon words w whose standard right 
factor has normalized length rn{w) strictly smaller than [n — l)/n. 

Now, forgetting the "bad" Lyndon words, using the inequality r„ < 1 and lemma 
14.61 we obtain the following inequality: 



(6) 



But for w £ Cf^ n Qn, the standard right factor begins with the second smallest 
block Yjg of w. Thus, in this case, with the notations of the previous section: 
rn{w) = l — dn{w). Moreover, Cl^DQn and C'^OQn are stable under the permutations 
of blocks. Thus with the same setting as in Theorem 15.41 

(1 - d„(7r^.w))l£2 (w) + -Ic^ (w)] lg„(w) 



rn{x,w)lQ^ (w) 



law 



(7) = r„{w)lg„{w), 

where the right hand side is a random variable from X x £„, endowed with (Q 
P„ and the left hand side is from £„, endowed with the uniform probability I 
Keeping in mind the result of the previous theorem, wc introduce for (x, w) 
X X £„, 

Sn{x,w) = ((1 - t/,o(^)(a;))l£2 (u;) + lcl^{w)) le„(w). 
Now, by Theorem 15.41 

\\rn{x,w)lg^{w) - Snix,w) 

\rn{x,w)lg^{w) - Sn{x,w) 



< 
< 

< 



1 



{r„{x,w) - (1 - ?7,„(„)(.t))) IciM l£^H 



(8) 



< o 



l/2^ 



Note that the position ofYj^, the second smallest block of w in tTx-w, is governed by 
Ujg, which is clearly a uniform random variable on [0, 1], independent of w. Note 
also that, thanks to lemma Ol P„(£i^ n Gn) ~ 1/2, and then P„(£^ n Gn) ~ 1/2: 
consequently, the law of s„ under Q ® P„ converges weakly to ^{dx) = ^[Si{dx) + 
l[o,i]{x)dx]. 
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Now, as \\rn{x,w)lg^{w) — s„(a;, w)||q^p^ goes to by (jHJ, a classical result (see 
for instance the book by Billingsley |2| Th. 4.2, p. 25] in the first edition) ensures 
that the distribution of r„lc;^, as a random variable on X x Cn, also converges 
to ix{dx) = ^[5i{dx) + l[o.i](x)(ia;]. Using Q, the distribution of r„lg^, as a 
random variable on £„, also converges to the same limit. Finally, © ensures the 
convergence of the distribution of r„ to the same limit. H 

7. Generalization to the case of q letters 

In this section, wc generalize the previously obtained results to the case of a 
totally ordered alphabet with q letters: A — {ai, 02, . . . , flg}, q £ {2, 3, 4, ...} and 
ai < 02 < ■ • • < Oq. All the technics developed for the simple case of two letters can 
be readily adapted in this context and we just give the results and some indications 
for the adaptations needed. 

1. Denote by the set of words with length n and by Cn = Cn{{ai, 0,2, ■ ■ ■ , Oq}) 
the subset of Lyndon words. The probability measures P„ and P„ are defined as 
before. As previously, we have: 

n 

card{Cn) = ^{l + 0{q-^/^)). 
n 

The link between random Lyndon words and random words still holds: if A C Cn, 
we have: 

\rn{A)-Fn{p-\A)) \<0{q-"/^). 

2. Let w be a word in An- As previously, we define its runs, its number of runs 
Nn{w) and the length of these runs Xi{w), . . . ,Xn^{w). To build these random 
variables, we introduce a family (Zi)igN of independent identically distributed geo- 
metrical random variables with parameter {q — l)/q defined on a probability space 
(f2,^, P), and (ei)igN, a family of independent and identically distributed random 
variables with uniform distribution on {ai, 02, a^}, and independent of the Z^'s. 
To obtain a random sequence of letters, do the following: 

• Select the letter ei, and write a run of Zi such letters. 

• Select the letter £2 conditioned to be distinct of ei, and write a run of Z2 
such letters. 

• Proceed by recurrence: select the letter e„+i conditioned to be distinct of 
e„, and write a run of Zn+i such letters. 

Truncating the n first letters gives a random variable defined on O and uniformly 
distributed on As previously, 

Nn{Lo) inf jfc e N, ^ Z.,{uj) > n| . 

3. Estimate the number of runs by using the fact that Nn — 1 follows a binomial 
law with parameters — 1, as in lemma IX^ 

4. Estimate the length M„ of the longest run and the length M^^ of the largest 
run of fli as in lemma rTKl and lemma imi bv using the same estimates on geometrical 
laws. The typical order of M„ and Af is log^ n. 

5. Define the long blocks: 

Definition 7.1. Let w be a word with length n. The long blocks of w are the 
subwords of w that: 
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• begin with a run of "ai " with length equal or greater than (1 — e) log2 n, 

• end just before an other run of "ai " ( and consequently end with a run of a 
letter distinct from "ai " 

• have the smallest possible length larger than Slogan. 

Their number H„ is, as in lemma [4.21 of order n'^. To prove this, introduce, 
for i > 1, the variable Bi = l{Zi>(i-e) log n}- Then the (-Bi)i>i are independent 
identically distributed Bernoulli random variables with parameter pn_e satisfying 
n^~^ < p„ £ < qn'^~^, thus we can have large deviation results. 

6. We verify then that the long blocks do not overlap too often and are distinct 
with high probability, as in lemmas 14.31 and 14.41 Good Lyndon words are defined 
in the same manner as previously. Define the short blocks: 

Definition 7.2. For a good Lyndon word w G Qn, we cut each section stretching 
between two long blocks into short blocks, that begin with a run of 'ai " and end just 
before the next run of "oi ". 

7. All is thus in place to permute the blocks as previously. With the same setting 
as before, we obtain: 

Theorem 7.3. We have: 

'"log,ml/2^ 



This implies in particular that the law of (i„ under P„ converges weakly to the 
uniform law on [0, 1] and that every moment of d„ converges to the corresponding 
moment of the limit law 

8. To conclude for the length of the right factor, we split the set £„ in two parts: 

• £^ = ai£„_i({ai, 02, . . . , aq})Ua2Cn-ii{a2, ag})U- ■ •Ua5_i/:„_i({a,_i, 
contains exactly the Lyndon words w whose standard right factor is ob- 
tained by deleting the first letter of the word and has thus normalized 
length rn{w) = (n — l)/n, 

• ~ C'^\C'^^ contains exactly the Lyndon words w whose standard right 
factor has normalized length r„(ti;) strictly smaller than (n — ^)/n. 

The only difference is that 

p ^ card{Cn-ii{ai,a2, ■ ■ ■,aq})) H h card{Cn-i{{aq^i,aq})) 1 

cara[Ln) q 

which gives Theorem ll.il 
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